# Low-resource language optimization
Nllb1.3 Smugri4 V0.01
This is a version of the NLLB-1.3b model fine-tuned with parallel data for 29 Finno-Ugric languages, supporting the generation of multiple dialects/variants.
Machine Translation
Transformers Supports Multiple Languages

N
tartuNLP
39
2
Stt Bm Quartznet15x5 V0
This is a Bambara automatic speech recognition model fine-tuned based on the NVIDIA NeMo framework, suitable for Bambara speech-to-text tasks.
Speech Recognition Other
S
RobotsMali
88
1
Fish Speech 1.5 Ukrainian
A Ukrainian-specific speech synthesis model fine-tuned based on Fish Speech 1.5, supporting high-quality voice generation for 55 speakers
Speech Synthesis Other
F
skypro1111
43
4
Whisper Small Uzbek
Apache-2.0
Uzbek automatic speech recognition model fine-tuned from OpenAI Whisper-small on Common Voice 17.0 dataset
Speech Recognition
Transformers Other

W
abduaziz
20
2
Llama SEA LION V3 8B
Llama-SEA-LION-v3-8B is a multilingual large language model optimized for Southeast Asian languages, supporting 11 Southeast Asian languages and having undergone continuous pre-training on approximately 200 billion tokens.
Large Language Model
Transformers Supports Multiple Languages

L
aisingapore
1,964
2
F15
Fish Speech V1.5 is a leading text-to-speech (TTS) model trained on over 1 million hours of multilingual audio data.
Speech Synthesis Supports Multiple Languages
F
cocktailpeanut
5,162
0
Nllb 200 3.3B Ctranslate2
NLLB-200 is a neural machine translation model supporting 200 languages, focusing on translation research for low-resource languages.
Machine Translation Supports Multiple Languages
N
entai2965
25
2
Openlid V2
Gpl-3.0
OpenLID-v2 is a high-coverage, high-performance language identification model supporting 200 language variants, an improved version of OpenLID.
Text Classification
O
laurievb
273
2
EXLMR
Apache-2.0
EXLMR is an extended version of XLM-R that supports new languages by expanding the tokenizer vocabulary to mitigate out-of-vocabulary issues, specifically optimized for low-resource Ethiopian languages.
Large Language Model
Transformers Other

E
Hailay
27
0
XLSR WithLM Malayalam
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the IMaSC, Indic TTS Malayalam, and OpenSLR Malayalam training datasets, supporting automatic speech recognition for Malayalam.
Speech Recognition
Transformers

X
kavyamanohar
19
4
SSA HuBERT Base 60k
A self-supervised speech model based on the HuBERT architecture, specifically optimized for 21 languages in Sub-Saharan Africa with 60,000 hours of training data
Speech Recognition
Transformers

S
Orange
995
11
Poro 34B Chat
Apache-2.0
The Poro 34B Chat version is an instruction-following model fine-tuned based on Poro 34B, supporting bilingual interaction in Finnish and English, jointly developed by Silo AI, the TurkuNLP group, and HPLT.
Large Language Model
Transformers Supports Multiple Languages

P
LumiOpen
465
12
Viking 33B
Apache-2.0
Viking 33B is a 33 billion parameter decoder-only Transformer model supporting Finnish, English, and multiple Nordic languages, with capabilities in code understanding and generation.
Large Language Model
Transformers Supports Multiple Languages

V
LumiOpen
1,030
25
Viking 13B
Apache-2.0
Viking 13B is a 13-billion-parameter multilingual large model supporting Finnish, English, and other Nordic languages, with code processing capabilities
Large Language Model
Transformers Supports Multiple Languages

V
LumiOpen
1,233
12
Viking 7B
Apache-2.0
Viking-7B is a 7-billion-parameter Transformer model specializing in Finnish, Nordic languages, and programming code, trained on 2 trillion tokens.
Large Language Model
Transformers Supports Multiple Languages

V
LumiOpen
2,000
42
Nllb Moe 54b 4bit
NLLB-MoE is a Mixture of Experts machine translation model developed by Meta, supporting 200 languages, and is one of the most advanced open-access machine translation models available.
Machine Translation
Transformers Supports Multiple Languages

N
KnutJaegersberg
17
5
Gpt Sw3 20b Instruct 4bit Gptq
Other
GPT-SW3 is a large-scale Nordic language model developed by AI Sweden, supporting text generation tasks in 5 Nordic languages and English.
Large Language Model
Transformers Supports Multiple Languages

G
AI-Sweden-Models
60
4
Madlad400 10b Mt
Apache-2.0
A general-purpose language model supporting over 100 languages, suitable for various natural language processing tasks.
Large Language Model Supports Multiple Languages
M
google
2,412
110
Madlad400 3b Mt
Apache-2.0
A multilingual processing model supporting over 100 languages, suitable for various natural language processing tasks.
Large Language Model Supports Multiple Languages
M
google
7,035
119
Madlad400 8b Lm
Apache-2.0
A multilingual processing model supporting over 200 languages, suitable for various natural language processing tasks.
Large Language Model Supports Multiple Languages
M
jbochi
71
8
Wav2vec2 Phenome Based Alffaamharic
Apache-2.0
A wav2vec2-based speech recognition model, fine-tuned at the phoneme level for Amharic
Speech Recognition
Transformers

W
Samuael
34
2
Nllb Clip Large Oc
NLLB-CLIP is a multilingual vision-language model combining the NLLB model's text encoder with CLIP's image encoder, supporting 201 languages.
Text-to-Image
N
visheratin
28
2
Nllb Clip Base Oc
NLLB-CLIP is a multilingual vision-language model combining the NLLB text encoder with the CLIP image encoder, supporting 201 languages
Text-to-Image
N
visheratin
371
1
Mms Tts Smo
Samoan text-to-speech model developed by Meta, based on the VITS architecture, supporting high-quality speech synthesis
Speech Synthesis
Transformers

M
facebook
21
0
Mms Tts Kri
Creole text-to-speech model developed by Meta, utilizing VITS end-to-end architecture for high-quality speech synthesis
Speech Synthesis
Transformers

M
facebook
289
0
Mgpt 1.3B Kazakh
MIT
A Kazakh language model based on mGPT-XL (1.3B) with 1.3 billion parameters, specifically optimized for Kazakh
Large Language Model
Transformers Supports Multiple Languages

M
ai-forever
74
7
Nllb 200 Distilled 1.3B Ct2 Int8
A multilingual processing model supporting over 100 languages and scripts, covering from mainstream languages to various dialects and regional variants.
Large Language Model
Transformers Supports Multiple Languages

N
JustFrederik
67
2
Whisper Small Sinhala Fine Tune
Apache-2.0
A speech recognition model fine-tuned on Sinhala language based on OpenAI Whisper-small
Speech Recognition
Transformers

W
Subhaka
78
6
Qazgpt2
A GPT-2 language model trained from scratch on Kazakh news and Wikipedia corpus, supporting Latin script Kazakh text generation and processing.
Large Language Model
Transformers

Q
amandyk
69
5
Nllb Moe 54b
NLLB-Mo is a multilingual machine translation model that supports translation tasks for over 200 languages.
Machine Translation
Transformers Supports Multiple Languages

N
facebook
579
116
Nllb
NLLB-200 distilled version with 600M parameters, supporting machine translation tasks for 200 languages
Machine Translation
Transformers Supports Multiple Languages

N
Narsil
113
2
Small100
MIT
SMaLL-100 is a compact and fast large-scale multilingual machine translation model, covering over 10,000 language pairs, with performance comparable to M2M-100 but smaller in size and faster.
Machine Translation
Transformers Supports Multiple Languages

S
alirezamsh
5,374
81
Labse En Ru Myv V1
The first neural machine translation system for the Erzya language, improved based on the LaBSE model, supporting sentence embedding representations for Russian and Erzya
Text Embedding
Transformers Other

L
slone
17
0
Nllb 200 3.3B
A multilingual processing model supporting over 100 languages and writing systems
Large Language Model
Transformers Supports Multiple Languages

N
facebook
358.62k
315
Asr Wav2vec2 Dvoice Wolof
Apache-2.0
This is an automatic speech recognition model for Wolof, based on the wav2vec 2.0 architecture, trained on the DVoice dataset, supporting Wolof speech transcription.
Speech Recognition Other
A
speechbrain
44
4
Wav2vec2 Large Xls R 300m Finnish
Apache-2.0
This model is a Finnish speech recognition model fine-tuned from facebook/wav2vec2-xls-r-300m on a general speech dataset.
Speech Recognition
Transformers

W
bekirbakar
18
0
Dvoice Darija
Apache-2.0
This model is an automatic speech recognition system trained on the DVoice Dari dataset, using the wav2vec 2.0 architecture, supporting transcription of Moroccan Arabic dialect.
Speech Recognition Other
D
aioxlabs
22
7
Bloom 560m
Openrail
BLOOM is a multilingual large language model developed by BigScience, supporting 46 natural languages and 12 programming languages, built on the Transformer architecture with 560 million parameters.
Large Language Model Supports Multiple Languages
B
bigscience
664.36k
353
Mbart Large 50 Finetuned Ar Wikilingua
Text summarization model fine-tuned on Arabic wiki_lingua dataset based on mbart-large-50
Text Generation
Transformers

M
ahmeddbahaa
21
0
Opus Mt Tc Big En Gmq
This is a neural machine translation model for translating from English to North Germanic languages (including Danish, Faroese, Icelandic, Norwegian Bokmål, Norwegian Nynorsk, and Swedish), part of the OPUS-MT project.
Machine Translation
Transformers Supports Multiple Languages

O
Helsinki-NLP
372
3
- 1
- 2
- 3
Featured Recommended AI Models